Lecture 5 : Introduction to ( Robertson / Spärck Jones ) Probabilistic Retrieval

نویسندگان

  • Ellis Weng
  • Andrew Owens
چکیده

In this lecture, we will introduce our second paradigm for document retrieval: probabilistic retrieval. We will focus on Roberston and Spärck Jones’ 1976 version, presented in the paper Relevance Weighting of Search Terms. This was an influential paper that was published when the Vector Space Model was first being developed — it is important to keep in mind the differences and similarities between these two models and the motivations for each. Recall that the Vector Space Model was originally a representation model. The retrieval scheme of the Vector Space Model was empirically-driven and chosen in a fairly atheoretical manner. In contrast, probabilistic retrieval is more principled and theoretically-driven. On the other hand, many statistical estimations and empirical substitutions will drive the derivation of this paradigm.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Chapter 1 LANGUAGE MODELLING AND RELEVANCE Karen

1. Introduction This paper addresses three questions about the Language Modelling (LM) approach to information retrieval. These questions are about LM and relevance. They arise because relevance has always been taken as fundamental to information retrieval (see, e.g. Saracevic, 1975, or Mizzaro, 1997). Thus from the standpoint of retrieval theory, the presumption has been that as relevance is t...

متن کامل

Lecture 10 – March 2 Lecturer : Prof Lillian Lee Scribes : Jerzy Hausknecht & Kent Sutherland More on Language Models

In the previous lecture, we discussed the idea of relevance models, as presented in [Lavrenko & Croft 01]. For each query, a language model for relevance is constructed. The final product is a language model based on a collection of documents. The final model estimation details were very similar to query likelihood, even though the relevance model was derived from the ideas in [Robertson & Spär...

متن کامل

A probabilistic model of information retrieval: development and comparative experiments - Part 1

The paper combines a comprehensive account of a probabilistic model of retrieval with new systematic experiments on TREC Programme material. It presents the model from its foundations through its logical development to cover more aspects of retrieval data and a wider range of system functions. Each step in the argument is matched by comparative retrieval tests, to provide a single coherent acco...

متن کامل

A Survey On Re - Ranking of Images

There is a huge amount of research work which gives information about image search re-ranking. The diverse work is need to be collected for getting more information in this area. This paper presents survey of various techniques which are used for image re-ranking. Firstly it introduces the object queries which gives result images specific to some kinds of objects and retrieval models. In next s...

متن کامل

Wearing proper combinations

This paper discusses the proper treatment of multiple indexing fields, representations, or streams, in document retrieval. Previous experiments by Robertson and his colleagues have shown that, with a widely used type of term weighting and fields that share keys, document scores should be computed using term frequencies over fields rather than by combining field scores. Here I examine a wide ran...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010